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FEATURE CONSTRAINT BASED DOCUMENT 
INFORMATION RETRIEVAL AND DISTRIBUTION 

The present invention relates to data processing, and more particularly relates to the transfer 
between computing devices, and the retrieval by such devices, of document related information. Even 
more particularly, the invention concerns such information retrieval and transfer using feature constraints. 

While the use of portable computing devices is becoming more widespread, it is as yet not 
possible to store in such devices the electronic files for all the documents that a user may wish to have 
access to (e.g. due to storage capacity limitations), and the transfer of bulk documents between such 
devices, or between one such device and a desktop office or home computer may be time consuming or 
otherwise user unfriendly. At the same time, there is growth in the provision of electronic document 
repositories and expanding use of document transfer via the internet. 

This situation is addressed in EP-A-691,619 (hereafter "EP'619'*). which discloses a system 
including any number workstations, file servers, printers and other fixed devices (including multifunction 
devices) coupled in a network, and a number of portable devices (e.g. handheld or wristwatch computer) 
carried by users and coupled to the network by infrared (IR) link. Each portable device emulates its user's 
personal satchel for documents: the device is programmed to receive transmit and store document 
references (World Wide Web URLs), each of which is associated with an electronic document stored in an 
electronic repository at a site on the web. Documents are distributed from one person to another by 
transmission of document URLs, and a document is sent to a printer by beaming that document's URL to 
an IR transceiver associated with that printer. 

A problem with such a system is that it stores and distributes links (Web URLs) only to single 
documents: each document reference points to a particular document. A further problem is how to keep 
track of a continually-changing electronic document repository, to dynamically adjust to changing 
document descriptions (updated author list, updates dates, updated keywords, etc.) For example, in the 
device of EP'619 the user may be carr/ing around a URL for a document A written by an author B on the 
subject of topic C. Unknown to the user, there may be in existence author B's updated document A', 
written 2 years after A, on the same topic C, or there may be a further document A" by the same author 
and on a topic closely related to C (and, for example, having a keyword in common). The user does not 
have the link to A' or A" and, even though they would be of use to him, without carrying out a further 
searching/browsing exercise to look for them and retrieve the URLs, he would continue about his work 
oblivious of them. 

It would be desirable to have a system which avoided the lack of dynamicity of existing 
.stems, and which enabled users to have to hand pointers to a set of documents which may be 
-mically changing. 

The present invention provides a method carried out in a data processing device including a 
memory, and a user interface, comprising: (a) receiving at least one user input, the user 
'ng at least one relation, the or each relation defining a document related entity and a 



property of the entity, (b) compiling a feature constraint, the feature constraint including the relation(s) 
received in step (a). 

The present invention further provides a data processing device when suitably programmed 
for carrying out the method of any of the appended claims, the device comprising a processor, a memory, 
and a user interface. 

The present invention further provides a data processing device, system or apparatus 
according to claims 19-22 of the appended claims. 

The present invention further provides a data processing device comprising: a processor, a 
memory coupled to the processor, and a user interface coupled to the processor and to the memory and 
adapted to be operable by a user to generate user inputs, means for receiving at least one user input, the 
user input(s) defining at least one relation, the or each relation defining a document related entity and a 
property of the entity, and for compiling a feature constraint, the feature constraint including the received 
relations. ■ 

The aforementioned problems are overcome by using a constraint description: that is, instead 
of using a pointer to a single document or to a static set of documents, feature constraints relating to 
documents are stored and distributed, the feature constraints providing pointers to a dynamiic set of 
documents satisfying the constraints given. 

An advantage is apparent from a consideration of an example, Supposing the feature 
constraint constrains only the author name (e.g. Smith) and the date of the document (e.g. >1995), then 
this will point to any document authored by Smith later than 1995. If, then, in 1998, Smith authors another 
document, this document is automatically linked with the 'old' constraint, without further user intervention. 

A further benefit is in the transfer of document reference,s between the portable devices of 
multiple users when the users (for example in a meeting at a remote location) are away from their office 
workstations. If the sender knows of there is a document in existence that he'd like to send, but doesn't 
have the URL on his device, or cannot recall how to identify it on his device and therefore cannot send the 
URL. but he knows an author's name, a date (range) of creation, and/or a keyword, he can thus send a 
feature constraint to the recipient. The recipient can then use the feature constraint to retrieve the 
document on his desktop workstation once he returns to his office. 

Embodiments of the invention will now be described, by way of example, with reference to 
the accompanying drawings, in which; 

Figure 1 illustrates schematically a data processing network which may be used to implement 
an embodiment of the invention; 

Figure 2 shows a portable computing device used in accordance with an embodiment of the 

invention: 

Figure 3 illustrates schematically the scope defined by a signed feature constraint: 
Figure 4 is a schematic flow chart of the steps in compiling a feature constraint; 
Figure 5 is a schematic diagram of the user interface of a fixed computing device at onf 
instant during the entry by a user of a query; 



Figure 6 is a schematic diagram of the user interface of a portable computing device at one 

instant; 

Figure 7 is a schematic flow chart of the steps in sending a feature constraint from one 
(portable/fixed) device to another (portable/fixed) device; 

Figure 8 is a schematic illustration of a data packet encoding a feature constraint; 

Figure 9 is a schematic flow chart of the steps in using a feature constraint to retrieve 
document references and display or print corresponding documents 

Figure 1 0 illustrates a portion of a list of hits obtained during the process of Fig. 9; 

Figure 11 shows selected hits from the list of Fig. 10 after transformation into HTML format; 

and 

Figure 12 illustrates a more detailed presentation of a single selected hit. 

1. System hardware 

It will be appreciated that the present invention may be implemented using conventional 
computer network technology, either using a local area network (LAN) or, more suitably, a wide area 
network (WAN). The invention has been implemented using conventional web browser software (e.g. 
Netscape) providing cross-platform communication and document transfer over the internet. This is 
schematically illustrated in Fig. 1. It will be appreciated that each machine 22, 24, 26 forming part of the 
network 21 may be a PC running Windows^", a Mac running MacOS, or a minicomputer running UNIX, 
which are well known in the art. For example, the PC hardware configuration is discussed in detail in The 
Art of Electronics, 2nd Edn, Ch. 10, P. Horowitz and W. Hill, Cambridge University Press. 1989. However, 
it will be appreciated that the invention may be implemented using different system configurations: see 
EP'619. 

Upon request of a user at machine 22 a document stored on machine 26 may be retrieved 
and sent from machine 26 over the internet, via any number of intermediate machine 24 to machine 22. As 
is well known, the document may be retrieved using as a unique identifier its World Wide Web URL, as 
discussed in EP'619. Preferably also connected to the network 21 are any number of printers or 
multifunction devices (capable of scanning/printing/faxing, etc.) (not shown), as discussed in EP'619. 
Multifunction devices are discussed in more detail in EP-A-741,487. Each machine coupled to the 
network is also equipped with appropriate hardware and software, which is known in the art, for 
communication with portable computing devices, such as personal digital assistants (PDAs), handheld 
PCs, or pocket or wristwatch computers. 

A preferred form of portable computing device is shown in Fig. 2. This device 2 is discussed 
in more detail in EP'619. However, it will be appreciated that the computing device may be fabricated in a 
multitude of forms: for example, the device 2 may be of any of the forms disclosed in European patent 

application EP-A- (attorney's reference D/95594/JDR), corresponding to U.S. application S.N. 

08^ _ ("Personal storage device for application and data transfer", filed 28 March 1996). In its 

preferred form, computing device 2, and each of the fixed machines on the network 21, are equipped for 
infrared communication and suitably, the data packets transmitted between the computing device 2 and 
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the fixed machines, and enabling the document(s) to be retrieved, conform to the physical and link layer 
formats (IrLAP) described in the industry standard Infrared Data Association (IrDA) specification, version 
1 .0. which is well known in the art, 

2. Knowledge brokers 

According to one embodiment of the invention, in retrieving a document for local or remote 
printing, or transfer to another machine, each machine in Fig. 1 employs brokers. 

Brokers are software agents which can process knowledge search requests. Knowledge is 
taken here to be any piece of electronic information intended to be publicly accessible. Different, possibly 
distributed, information sources are assumed to be available, from a simple file in a user's directory to a 
database local to a site, up to a wide area information service (WAIS) on the internet, for example. 

When receiving a request, a broker may have sufficient knowledge to process it. or may need 
to retrieve more knowledge. For that purpose, it releases sub-requests, aimed at other brokers. Thus, 
knowledge retrieval is achieved by the collaboration of ail the brokers which are alternatively service 
providers processing requests and clients of these services generating sub-requests. The infra-structure 
required to support such collaboration, and the' way knowledge is stored locally within each broker is not 
discussed in detail here (for details of a suitable model, the reader is referred to Andreoli et al (1996), The 
Constraint-Based Knowledge Broker Model: Semantics, Implementation and Analysis, J. Symbolic 
Computation). The following discussion addresses rather the knowledge manipulations occurring within 
each broker. 

In order to collaborate, the brokers must at least understand each other. This means that all 
the requests must be formulated in a common language (and also all the answers to the requests), even if 
the brokers may perform local translations. Logic provides the adequate language for such a purpose. A 
request can be expressed by a pair (x, P) where x is a logical variable and P a logical formula involving x, 
meaning ''Retrieve knowledge objects x such that the property expressed by formula P holds". 
Interestingly, an answer to such a request can be expressed in the same formalism, i.e. a pair (x, Q) 
meaning 'T/?ere exists a knowledge object x satisfying the property expressed by formula Q". The 
requirement here is that P must be a logical consequence of Q, so that the answer contains at least as 
much knowledge as the request. Moreover, the same logical formalism can be used to capture the scope 
of a broker, i.e. the area of knowledge it is concerned with: a broker with scope (x, R) means '7 am not 
capable of retrieving knowledge objects x which do not satisfy the property expressed by formula R\ In 
many situations, the scope of a broker may vary, because it gets specialised or. on the contrary, expands 
its capacities, either externally or due to the knowledge retrieval process itself. 

In other words, logic provides a common language where both requests, answers and scopes 
can be expressed. Brokers then perform logical operations on these three components. The most 
important logical operation, from which all the others can be reconstructed, is satisfiability checking, i.e. 
deciding whether some object could satisfy the property expressed by a formula, or, on the contrary, 
whether it is intrinsically contradictory. Unfortunately, it is well known that this operation, for full Classical 
Logic. IS not algorithmic, i.e. it is provably impossible to write a program which implements it and always 



terminates. Given this limitation, a lot of research in knowledge representation has been focused on 
identifying fragments of Classical Logic in which satisfiability is algorithmically decidable. The trade-off 
here is between expressive power and tractability; the empty fragment, for example, is obviously tractable, 
but it is not very expressive. 

The most popular fragment which emerged is known as "feature constraints" (FCs). The 
satisfiability problem in this case is also known as "feature constraint solving". 

As is known, feature constraints are built from atomic constraints which are either sorts or 
features. A sort is a unary relation, expressing a property of. a single entity. For example, P: person 
expresses that an entity p is of sort person. A feature is a binary relation expressing a property linking 
two entities. For example, p : employer- >e expresses that entity p has an employer, which is an entity e. 
Apart from sorts and features, most feature systems also allow built-in relations such as equality and 
disequality. 

3. Principles of feature constraints 

The full fragment of feature constraints, where the atomic components mentioned above are 
allowed to be combined by all the logical connectives (conjunction, disjunction, negation and quantifiers), 
although very expressive, is hardly tractable. A subfragment called "basic feature constraints" (BFC) has 
been considered, where negation and disjunction are simply forbidden. Efficient constraint solving 
algorithms have been proposed for this sub-fragment. However, a drawback is that the complete absence 
of negation puts strong limitations on the kind of operations a knowledge broker may wish to perform. 

In preferred embodiments, the present invention makes use of a powerful operation, referred 
to as "scope-splitting", which relies on the use of negation. Indeed, a'broker may wish to split its scope, 
specified by a pair (x, P) according to a criterion expressed by a formula F, thus creating two brokers with 
scope P A F and P a-, F. Thus, a broker in charge of bibliographic information may wish to split its scope 
into two new scopes: "books written after 1950", which can be represented by the BFC 

X 

X : book 

X • : year - > Y 

Y > 1950 

and its complement, i.e. "books written before 1950 or documents which are not books"; this latter scope 
cannot be expressed using BFC, because negation and disjunction cannot be dispensed with. It has been 
discovered that the scope splitting operation is useful in many situations, for example to implement 
brokers capable of memorising and re-using information gathered during their lifetime.- Embodiments of 
the present invention make use of, on the one hand, a fragment of feature constraints, called "signed 
feature constraints" (SFC), which allows limited use of negation, precisely capable of expressing the kind 
of split scope mentioned above, and on the other hand, an efficient constraint solving method for SFC. 
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3.1 Signed Feature Constraints 

A signed feature constraint is composed of a positive part and a list of negative parts, both of 
them being basic feature constraints. For example, the following signed feature constraint 
p 



+ p 


: person. 


p 


: employer -> E, 


E : 


: "Xerox" 


- P ; 


: nationality-> N, 


N : 


"American" 


- P : 


: spouse -> P' 


P ' : 


: person 


P • : 


employer - > E ' 


E ' : 


; "Xerox" 



specifies a Xerox employee who is not American and is not married to another Xerox employee. This is 
represent it graphically as in Fig. 3. The round boxes denote the entities (logical variables), the sort 
relations (unary) are represented by dashed arrows labelled by the name of the sort in a square box, the 
feature relations (binary) are represented by plain arrows labelled by the name of the feature in a square 
box. The built-in predicates (not present in the example) are represented by rhombuses. The positive part 
of the SFC is contained in the top box and marks the distinguished entity of the scope (p in the example) 
by a double round box. The negative parts of the SFC are contained in the lower boxes in grey. 
The main interest of SFC comes from the following property; 

if the scope of a broker is represented by an SFC eo, and this scope is split by a BFC e, then 

the two resulting split scopes e", e" are both SFC. 
Indeed, e* is obtained by merging the positive part of e^, with the BFC e; and e* is obtained by extending 
with a new negative part containing e alone. For example, assume a broker in charge of a bibliographical 
database containing various documents (books, videos etc.) about Art, but not authored by an American. 
It is represented by the SFC 

X 

+X : topic-> T 

T : "Art" 

-X ; author-> A 

A : nationality- > N 

N : "American" 

It may be split by the constraint "books written after 1950", expressed by the BFC 

X 

X : book 

X : year-> Y 

Y > 1950 



The resulting scopes are simply 

X 

+x : book 

X : topic-> T 

X : year-> Y 

T : "Art" 

Y > 1950 

-X : author-> A 

A : nationality-> N 

N : "American" 

i.e. "Art books written after 1950 but not by an American author" and 



X 




+ X : 


topic-> T 


T : 


"Art" 


-X : 


author- > A 


A : 


nationality-> N 


N : 


" American " 


-X : 


book 


X : 


year-> Y 


Y > 


1950 



i.e. "Art documents not authored by an American but not books subsequent to 1950". 

3.2 Solving Signed Feature Constraints 

Most constraint systems make a number of assumptions on the nature of sorts and features, 
called the axioms of the systems. These axioms are crucial to the satisfiability algorithm, since they 
determine when a feature constraint is contradictory and when it is satisfiable. 

3.2.1 Feature Axioms 

For the purpose of simplicity, the embodiment disclosed here makes use of a slight variant of 
the basic axiom system used in ATt-Kaci H. a/. (1994), A Feature-Based Constraint-System for Logic 
Programming with Entailment, Theoretical Computer Science 122. pp. 263-283. although it will be 
appreciated by persons skilled in the art that the principles of the method apply to other sets of axioms as 
well. 

1 . Features are functional: this means that if two pairs of entities which are constrained by the 
same feature have the same first term, they also have the same second term. For example, it can be 
considered that the feature spouse is functional (within a specific cultural setting), meaning that a 
person cannot have two spouses: if, for a person x, we have X: spouse ->y and X: spouse ->z, then 
the entities y and z coincide (i.e. denote the same person). Other systems allow multi-valued features. 
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2. Sorts are disjoint: this means that no entity can be of two distinct sorts. For example, a 
book is not a person: we cannot have an entity x with X:book and X: person. Other systems consider 
hierarchies of sorts where some entities can have multiple sorts as long as they have a common 
denominator in the hierarchy. 

3. There is a distinguished subset of sorts, called "value" sorts, so that no two distinct entities 
can be of the same value sort. Traditional basic elements (strings, numbers, etc.) are typical value 
sorts: for example, the string "Xerox" or the number 1950 are value sorts. Value sorts are not 
allowed to have features: this is the only axiom connecting sorts and features. Other systems consider 
more refined connections between sorts and features. 

4. There is a distinguished built-in binary predicate, equality, with the traditional congruence 
axioms (which involve sorts and features). The axioms describing all the other built-in predicates are 
assumed to contain no mention of sorts and features. 

These axioms are formally written in section A: Axioms in the Appendix at the end of this disclosure. They 
form a theory T. 

3.2.2 Constraint Satisfaction 

First, it is assumed that satisfiability over built-in predicates is decidable. This means that 
there is an algorithm which, given a formula F using only built-in predicates (F is also called a built-in 
constraint), can decide whether F is a logical consequence of the theory T (written \-t F). 

Constraint satisfaction over BFCs is defined by a set of conditional rewrite rules over BFCs 
(section 6.1 of the Appendix) which have the following properties 

(a) The system of rules is convergent and hence defines a "normal form" for BFCs. This can 
be shown in a classical way by proving that the system is "Church-Rosser" (critical pairs converge) 
and "Noetherian" (the size of the terms strictly decrease by rewriting), 

(b) A BFC is satisfiable if and only if its normal form is not reduced to the contradiction. One 
implication can be proved by showing that rewrite steps preserve satisfiability. The reverse implication 
can be proved by displaying a model which satisfies BFCs whose normal form is not reduced to the 
contradiction. 

Thus the rewrite rules describe the steps of the constraint satisfaction algorithm. This 
algorithm always terminates because the system of rewrite rules is convergent. It is to be noted that the 
definition of the rules rely on satisfiability tests of built-in constraints, which has been assumed decidable. 
This means that the algorithm is modular and can accommodate any kind of built-in constraints as long as 
a proper built-in constraint satisfaction algorithm is provided. 

Using rewrite rules for constraint satisfaction algorithm can be implemented in a naive way in 
some symbolic language like Lisp or Prolog, or can be optimised, taking into account the properties of the 
specific built-in constraints which are used. 

The algorithm for constraint satisfaction over SFCs (section B.2 of the Appendix) can 
informally be described as follows. Given an SFC, its positive component is first normalised by the 
algorithm for BFCs. If the result is a contradiction, the whole SFC is unsatisfiable. OthenA/ise, the positive 
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component (normalised) is inserted in each of the negative components, which are then normalised by the 
algorithm for BFCs. If a resulting negative component has a contradictory normal form, it is eliminated, 
and if it has a tautological normal form the whole SFC is unsatisfiable. The normal form for SFCs thus 
obtained has the following property: 

An SFC is satisfiable if and only if its normal form is not reduced to the contradiction. As in 
the previous case, the difficult part of the implication can be proved using model theory. 

4. User transactions with feature constraints 

Figure 4 is a schematic flow chart of the steps in compiling a feature constraint. Initially. (s41) 
the multiple user inputs are received as the user specifies a query, e.g. "books or articles after 1990 in 
which the title contains "constraints" but does not contain "internet". As the inputs are received, relations 
(see section 2 above) are determined. Figure 5 is a view of the user interface of a fixed computing device 
at one instant during the entry by a user of a query: this is discussed further in British patent application 

97 (agent's ref. R/97005Q2/JDR), filed contemporaneously herewith. Then, once the user has 

finished inputting the elements of the query, the feature constraint is compiled (step s42) from the relations 
obtained. The user may be prompted (s43) to enter a short query name, or a default one may be 
assigned, and the FC stored in its final form. 

This inputting by a user may be performed on a portable computing device in a manner 
known in the art, using a keyboard, touchscreen user interface, displayed lists and scrolling/control 
buttons etc. If the user interface for the portable device is small, the method disclosed in EP-A-733,964 — 
for accessing members of a stored data set — may be used. A further alternative is for the user to input 
the query, from which the FC is obtained, on a piece of paper via a scanner: the user checks boxes on a 
pre-printed standard sheet (on which possible relations are already indicated in human-readable form); 
and when the completed sheet is scanned, the relations are extracted from the resulting image data and 
used to compile the FC. This alternative is discussed further in the aforementioned GB appln. (agent's 
ref. R/97005Q2/JDR), 

Figure 6 is a schematic diagram of the user interface 60 of a portable computing device at 
one instant. The general techniques disclosed in EP'619 may be used in presenting information to the 
user. As in EP'619, icons 61 and document names 62 may be displayed which correspond to stored 
URLs. However, there is also displayed an icon 63 and short query name 64 corresponding to a stored 
FC. Optionally, the user may view the long version, for example by selecting icon 63 and performing 
button presses corresponding to a double mouse click in a conventional user interface. In this case, this 
might display the long version 'Books by Babbage after 1993 with title including "lecture" and "Object- 
Oriented" \ 

Figure 7 is a schematic flow chart of the steps in sending a feature constraint from one 
(portable/fixed) device to another (portable/fixed) device. As described in EP'619, the computing devices 
are context sensitive: an icon for each other user's device in range of IR communication is displayed on 
the user interface (step s71). After the user has requested sending a FC to another user in range (s72), if 
the request concerns a new, as yet non existent FC, then the process of Fig. 4 is followed, otherwise the 
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FC is retrieved from memory (s73). Then the FC is encoded in appropriate form and broadcast using any 
of the communications methods mentioned in EP'619. A data packet in which the FC may be encoded in 
the data field is schematically illustrated in Fig 8, However, the preferred means of communication is IR 
signals, and in particular by means of IrDA standard data transfer 

Returning to Fig. 7, the recipient's user interface also displays icons for all in-range users 
(step s75). On receiving an IR data packet, this is decoded to extract the FC (s77) and then stored in the 
designated location in the recipient device directory for future use. if required (s78). 

Figure 9 is a schematic flow chart of the steps in using a feature constraint to retrieve 
document references and display or print corresponding documents. This may be performed by a 
conventional computer device, or by a multifunction device or printer equipped with a user interface. 
Initially, a FC is received from a user, for example in a data packet from a user of a portable device, as 
illustrated in Fig. 8, or by input directly into the machine by a user operating a keyboard and mouse, or 
touch screen, as is well known in the art. Upon the instigation of the user by appropriate input, the FC is 
solved as described in section 3 above, and the resulting request in the appropriate form passed to the 
search engine (s92). The search request is used to search all available repositories for documents 
satisfying the FC (s93); and if necessary, the request may be broken down into subrequests as'discussed 
in more detail in the aforementioned British patent application (agent's reference R/97005Q2). 

Once obtained by the search engine, a list of hits — of documents satisfying the FC — is 
displayed (s94), as shown in Fig. 10. Then, in response to appropriate user input, operations may be 
performed to display individual hits with expanded detail of the document, to convert the document 
information to HTML format, or to download the document from the repository {s95). 

Figure 11 shows selected hits from the list of Fig. 10 after transformation into HTML format. 
As can be seen, for each hit there is displayed further information, such as author name, http_url, 
information source and title. If desired, the user can view the document for hit 1 by mouse clicking on the 
http_url displayed. The document can then be printed, if needed (s96). 

Figure 12 illustrates a more detailed presentation of a single selected hit, i..e. with a set of 
attributes of the document. It can be seen that against one or more of the attributes are displayed URLs 
providing links to further pages providing information related to those attributes. 

It will be appreciated that in implementing the above described techniques, feature 
constraints in the form of signed feature constraints, as discussed in section 3, may be employed. 
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Appendix 



A Axioms of the System 

There are three sets of axioms. 

Specific axioms for featxires ajid sorts : 

Let r, denote any sorts, and / denote any feature. 

Vr -t(i : r A X : r') if r 7^ T-' 

Vz.y X : T Ay : r D X z= y ]f r IS a value sort . 

Vz. y ^(z :r Ax y) if r is a value sort 

Congruence axioms for equality : 

Let p denote any built-in predicate. The traditional congruence axioms 
are: 

Vz z = z 

Vz,y x~yDy-x 
-yxyy.z x~yAy = zDx = z 
Vz,y xirAx-yJy.r 

"ix.y.z x-LyAx^zDz-^y 

"ix.y.z X -LyAy- zZ> X ^ z 
Vz,y p{x)AXi-y3p{y) 

where i is some index in the list of variable z and y is identical to z except 
that yi = y. 

Built -m predicate axioms : 

They must not mention sorts and features. For example, disequality can 
be axiomatized by 

"^^^y ^"^yyx-y 

Vz -^(x ^ x) 
Precedence constraints are axiomatized by 

Vz -•(z < z) 

Vz, y, z z < y A y < z 3 z < z 
The built-in predicates >, <, > can then be defined from < and equality, 

B Constraint Satisfaction 
B.l The BFC case 

We represent a BFC as a pair {B \ T) where 5 is a built-in constraint and F an 
ztnorrferci list of sort and feature constraints (read conjunctively). L denotes 
the contradiction. 
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There axe two sets of rewrite rules. The following rules correspond to sim- 
plifications of the BFCs. 

{B\xty,z^t,r) ^ {B Ay = t \ X -L y,T) \f )-T B D z = z 

{B \x -.r.y.T.r) ^ {5lr:r,r) if l-r 5Dr=y and r is not a value sort 

(5 I I : r, y : r, r) ^ (B A i = y | r : r, T) if r is a value sort 

The following rules correspond to the detection of inconsistencies. 

{B\r) ^ L \f f-r -5 

(B\x:r,y:T'J) ^ ± if Hr 5 D r = y and r ^ r' 

(5 I r : r, y i z, r) ± if Hr 5 D r = y and r is a value sort 

The following property justifies the aJgorithm 

{B\T) J. if and only if Kr V-i(S A /\ c) 

B.2 The SFC case 

We represent an SFC as an unordered list of BFCs prefixed with a sign (4- or -); 
by definition, one and only one component is positive. Let 5 be an SFC. The 
SFC-aonnal form of S is written 5* and is obtained by the following algorithm: 

Let Co be the BFC-normal form of the positive component of 5. 
If Co = ± Then 

Return L 
Else 

Co is of the form {Bo \ To) 
. t'et {{Bi I Ti)}i~i^_^n be the list of negative components of 5. 
Foreach i= 1,...,7Z 

Let a be the BFC normal form of {Bo A 5,- | Fc, Ti). 
If there exists t € 1, . . . , n such that = (5 | F) and Hr B and F is empty Then 

Return ± 
Else 

Let / = {t € l,...,n such that 7^ X} 
Return {+c^,{-Ci 

The following property justifies the algorithm 

n 

[+{Bo I r,), {-{Bi I ri)}7=i]- = -L if and only if hr V-.[(S,A /\ c)A/\ -(5,A /\ c)] 

cer, «=i cgr; 
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Claims: 

1. A method carried out in a data processing device including a processor, memory, and a user 
interface, comprising: 

• (a) receiving at least one user input, the user input(s) defining at least one relation, the or each 
relation defining a document related entity and a property of the entity. 

(b) compiling a feature constraint, the feature constraint including the relation(s) received in step 

(a). 

2. The method of claim 1, wherein, prior to step (b), step (a) is repeated a plurality of times. 

3. The method of claim 1 or 2, wherein the data processing device comprising device comprises a 
portable computing device, and step (a) comprises receiving user inputs via touchscreen or a keyboard. 

- 4. The method of claim 1 or 2, wherein the data processing device comprising device comprises a 
fixed computing device, and step (a) comprises receiving user inputs via touchscreen, a keyboard, and/or 
mouse. 

5. The method of claim 4, wherein the fixed computing device comprises a multifunction device 
including scanning means, 

wherein step a) comprises: 

(a1) scanning an image-bearing portable medium to produce electronic signals corresponding to 
at least one predetermined portion of the medium; and 

(a2) determining from said signals a relation corresponding thereto. 

6. The method of claim. 4, wherein said medium bears, for each said predetermined portion, a 
human readable indication of the disposition of said predetermined portion and, associated therewith, a 
human readable designator of the relation corresponding to said predetermined portion, 

7. A method carried out in a data processing device including a processor, memory, and a user 
interface, comprising: 

(d) receiving a first user input designating a graphical object corresponding to a stored feature 
constraint, the feature constraint comprising one or more relations, the or each relation defining a 
document related entity and a property of the entity, 

(e) receiving a second user input indicating that the feature constraint is to be sent to another 
data processing device, 

(f) encoding the feature constraint in a data packet, and 

(g) transmitting the data packet. 
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8. A method carried out in a data processing device including a processor, memory, and a user 
interface, comprising: 

(h) receiving a data packet 

(i) decoding the data packet to derive a feature constraint, the feature constraint comprising one 
or more relations, the or each relation defining a document related entity and a property of the entity, 

(j) storing the feature constraint derived in step (i). 

9. A method carried out in a data processing device including a processor, memory, and a user 
interface, the data processing device being couple in a network to one or more other data processing 
devices, at least one of the data processing devices including means for storing a repository of electronic 
documents, comprising: 

(k) receiving at least one user input designating a feature constraint, the feature constraint 
comprising one or more relations, the or each relation defining a document related entity and a property of 
the entity. 

(1) solving the feature constraint to determine therefrom one or more document references, the or 
each document reference corresponding to a document within said repository satisfying said feature 
constraint. 

10. The method of claim, wherein step (k) comprises the method of claim 8. 

1 1 . The method of claim 9, further comprising the step of: 

(m) displaying the or each document reference determined in step (I). 

12. The method of claim 1 1 , further comprising the step of: 

(n) in response to a second user input designating one of the displayed document references,: 
retrieving the document corresponding to said document reference from said repository and, optionally, 
displaying said document or a portion thereof, 

13. The method of claim 11 or 12, further comprising the step of: 

(o) in response to a third user input, causing the document corresponding to said document 
reference to be printed. 

14. The method of any of the preceding claims, wherein said feature constraint comprises at least 
a positive component and a negative component, each of the positive component and the negative 
component including one or more relations. 

15. The method of any of the preceding claims, wherein one of said relations is a sort relation, the 
sort relation indicating the sort of which the respective entity comprises. 
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16. The method of any of the preceding claims, wherein one of said relations is a feature relation, 
the feature relation indicating that the respective entity has an attribute, the value of said attribute being a 
second entity. 

17. The method of any of the preceding claims, wherein the property is a value, or range of 
values, which said entity may have. 

18. A data processing device when suitably programmed for carrying out the method of any of the 
preceding claims, the device comprising a processor, a memory, and a user interface. 

19. A data processing device comprising: 
a processor, 

a memory coupled to the processor, and 

a user interface coupled to the processor and to the memory and adapted to be operable by a 
user to generate user inputs, 

means for receiving at least one user input, the user input(s) defining at least one relation, the or 
each relation defining a document related entity and a property of the entity, and for compiling a feature 
constraint, the feature constraint including the received relations. 

20. A system for accessing or distributing electronic documents, including: a repository of 
electronic documents, each document having a corresponding document reference, and a plurality of 
objects, at least one of said objects being portable or mobile, -each object including means for 
communicating with the or each other object and with a user interface, and means for receiving, storing 
and/or transmitting a feature constraint, the feature constraint comprising one or more relations, the or each 
relation defining a document related entity and a property of the entity. 

21. A portable device for accessing or distributing electronic documents, including: means for 
communicating with fixed or mobile electronic devices and with a user interface, at least one of said 
devices including means storing a repository of electronic documents, each document having a 
corresponding document reference, and means for receiving, storing and/or transmitting a feature 
constraint, the feature constraint comprising one or more relations, the or each relation defining a 
document related entity and a property of the entity, 

22. An apparatus for scanning, copying and/or printing documents, including: means for 
accessing a repository of electronic documents, each electronic document having a corresponding 
document reference, means for communicating with one or more of a plurality of objects, at least one of 
said objects being- portable or mobile, and with a user interface, and means for receiving, storing and/or 
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transmitting a feature constraint, the feature constraint comprising one or more relations, the or each 
relation defining a document related entity and a property of the entity. 



ABSTRACT 



FEATURE CONSTRAINT BASED DOCUMENT 
INFORMATION RETRIEVAL AND DISTRIBUTION 



A system including any number workstations, file servers, printers and other fixed devices coupled in 
a network, and a number of portable devices carried by users and coupled to the network by infrared (IR) link. 
Each portable device emulates its user's personal satchel for documents: the device is programmed to receive 
transmit and store feature constraints (PCs). The PCs are distributed from one person to another by 
transmission of IR data packets, and once a network-connected computer, printer or multifunction device has 
received a PC via IR transceiver associated with it, a search request can be derived from the PC and 
document references (e.g. WWW URLs) satisfying the PC can be obtained via a search engine. Por each URL 
obtained, the corresponding electronic document can be retrieved from a repository and displayed or printed.. 
The portable device is preferably a handheld or wristwatch computer with a graphical display for enabling the 
user to transfer PCs, and the fixed devices preferably include a scanner/copier/printer having its own IR 
transceiver. ' , 
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